Class-Based Delta-Encoding: A Scalable Scheme for Caching Dynamic Web Content

نویسنده

  • Konstantinos Psounis
چکیده

Caching static HTTP traffic in proxy-caches has reduced bandwidth consumption and download latency. However, web-caching performance is hard to increase further due to the growing number of noncachable dynamic web-documents. Delta-encoding is a promising technique that exploits temporal correlation among different snapshots of a dynamic document, and renders dynamic traffic cachable. It achieves this by combining a cachable, previous snapshot of a document, called base-file, with a small difference-file, called delta, to generate the current snapshot of the document. However, it has not yet been deployed due to the significant scalability concerns related to the storage requirements for base-files on the server-side. In this paper we introduce class-based delta-encoding, a scalable scheme to perform delta-encoding on dynamic web-traffic. The idea is to group documents into classes, and store one document per class on the server-side. Thus, the proposed scheme exploits both temporal correlation in a dynamically evolving document, and spatial correlation among different documents. Finally, we present an architecture to deploy the scheme, that is transparent to clients, proxy-caches, and web-servers. Experimental results report that class-based delta-encoding combined with compression reduces the bandwidth consumption by a factor of 30, and the latency perceived by most users by a factor of 10 on average, without suffering from enormous storage requirements on the server-side. Keywords—web-caching, dynamic document, delta-encoding, delta, basefile, grouping, privacy.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Evaluation of ESI and Class-Based Delta Encoding

The portion of web traffic attributed to dynamic web content is substantial and continues to grow as users expect more personalization and tailored information. Unfortunately, dynamic content is costly to generate. Moreover, traditional web caching schemes are not very effective for dynamically-created pages. In this paper we study two acceleration techniques for dynamic content. The first tech...

متن کامل

Evaluation of Delivery Techniques for Dynamic Web Content

The portion of web traffic attributed to dynamic web content is substantial and continues to grow as users expect more personalization and tailored information. Unfortunately, dynamic content is costly to generate. Moreover, traditional web caching schemes are not very effective for dynamically-created pages. In this paper we study two new acceleration techniques for dynamic content. The first ...

متن کامل

HPP: HTML Macro-Preprocessing to Support Dynamic Document Caching

A number of techniques are available for reducing latency and bandwidth requirements for resources on the World Wide Web, including caching, compression, and delta-encoding [12]. These approaches are limited: much data on the Web is dynamic, for which traditional caching is of limited use, and delta-encoding requires both a common version base against which to apply a delta and the complete gen...

متن کامل

Class-Oriented Page Invalidation for Caching Dynamic Web Content

Caching dynamic pages at a server is beneecial in reducing server resource demands and it also helps dynamic page caching at proxy sites. Previous work has used ne-grain dependence graphs among individual dynamic pages and underlying data sets to enforce result consistency. Such an approach can be cumbersome or ineecient for a Web site to manage a cache in dealing with an arbitrarily large numb...

متن کامل

Scaling Java-based Dynamic Web Services: A Second-Year Project Proposal

A popular new class of Web-based services generates dynamic content and does not bene t from traditional static content caching. To improve the performance of these dynamic services, we must develop techniques to make them more scalable, reliable, and available. My approach to achieving better performance is to design a service cache that replicates the code and underlying data used to create t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002